Clustering of volatility as a multiscale phenomenon 
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The dynamics of prices in financial markets has been studied intensively both experimentally 
(data analysis) and theoretically (models). Nevertheless, a complete stochastic characterization 
of volatility is still lacking. What it is well known is that absolute returns have memory on a 
long time range, this phenomenon is known as clustering of volatility. In this paper we show that 
volatility correlations are power-laws with a non-unique scaling exponent. This kind of multiscale 
phenomenology, which is well known to physicists since it is relevant in fully developed turbulence 
and in disordered systems, is recently pointed out for financial series. Starting from historical returns 
series, we have also derived the volatility distribution, and the results are in agreement with a log- 
normal shape. In our study we consider the New York Stock Exchange (NYSE) daily composite 
index closes (January 1966 to June 1998) and the US Dollar/Deutsch Mark (USD-DM) noon buying 
rates certified by the Federal Reserve Bank of New York (October 1989 to September 1998). 



I. INTRODUCTION 

One of the most challenging problem in finance is the 
stochastic characterization of market returns. This topic 
not only has an academic relevance but it also has an 
obvious technical interest. Think, for example, at the 
option pricing models where distribution and correlations 
of volatility play a central role. 

It is now well established that returns of the most im- 
portant indices and foreign exchange markets have a dis- 
tribution with fat tails, and that they are uncorrelated 
on lags larger than a single day, in agreement with the 
hypothesis of efficient market. On the contrary, the dis- 
tribution of volatility and its correlations are still poorly 
understood. What it is known is that absolute returns 
(which are a measure of volatility) have memory on a 
long time range, this phenomenon is known in finan- 
cial literature as clustering of volatility. Recent studies 
provide a strong evidence for power-law correlations for 
absolute returns |@-||. Notice that in ARCH-GARCH 
approach ]t]-[| volatility memory is longer than a sin- 
gle time step but it decays exponentially, which implies 
that ARCH-GARCH modeling is inappropriate. Indeed, 
GARCH models have been extended in order to take into 
account this long memory properties [^,|To|-p2[| . 

In this paper we analyze the daily returns of the the 
New York Stock Exchange (NYSE) composite index from 
January 1966 to June 1998, and the US Dollar/Deutsch 
Mark (USD-DM) noon buying rates certified by the Fed- 
eral Reserve Bank of New York from October 1989 to 
September 1998. We not only find that volatility corre- 
lations are power-laws on long time scales up to a year for 
NYSE index and six months for USD-DM exchange rate, 
but, more important, that they exhibit a non- unique 
exponent (multiscaling) . This kind of multiscale phe- 
nomenology is known to be relevant in fully developed 
turbulence and in disordered systems |p^|, and it is re- 
cently pointed out for a financial series [14,15). Our re- 



sult is based on the fluctuation analysis of a new class 
of variable that we call generalized cumulative absolute 
returns. 

The second main result of the paper is the study of 
volatility probability distribution, which is derived by 
means of Fourier transform analysis. It is shown that 
it is well approximated by a log-normal distribution for 
NYSE index, while a log-normal shape is a reasonable fit 
only around the maximum for USD-DM rate. 

The paper is organized as follows: in section II we 
show that volatility has a long memory by considering 
the autocorrelation of absolute returns. Nevertheless the 
power-law behavior cannot be inferred by simply consid- 
ering autocorrelations. In order to have a sharper evi- 
dence for the nature of the long memory phenomenon, 
in section III, we perform a scaling analysis on the stan- 
dard deviation of a new class of observables, the gener- 
alized cumulative absolute returns. This analysis implies 
power-law correlations with non- unique exponent. In sec- 
tion IV the attention is focused on volatility probability 
distribution, computed from returns data by means of 
Fourier transform analysis, which turns out to be log- 
normal at least for NYSE index. In section V some final 
remarks can be found. 



II. CORRELATIONS FOR RETURNS 

We consider the New York Stock Exchange (NYSE) 
daily composite index closes (January 1966 to June 1998) 
and the US Dollar/Deutsch Mark (USD-DM) noon buy- 
ing rates certified by the Federal Reserve Bank of New 
York (October 1989 to September 1998). In the first case 
the dataset contains 8180 quotes, in the second 2264. The 
quantity we consider is the (de-trended) daily return, de- 
fined as 
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n = log — (log — — , 
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where St is the index quote or the exchange quote at time 
t. The time t ranges from 1 to N where N is the total 
number of quotes (8180 for the NYSE index and 2264 for 
the USD-DM exchange rate). The notation (•) indicates 
the average over the whole sequence of N data. 

As pointed out by several authors p^Q, the distri- 
bution of returns is leptokurtic. In p7[ , it was firstly 
proposed a symmetric Levy stable distribution and more 
recently in |Q it is argued that the distribution is Levy 
stable except for tails, which are approximately expo- 
nential. The estimation is that the shape of a Gaussian 
is recovered only on longer scales, typically for monthly 
returns. 

Let us introduce the autocorrelation for returns, de- 
fined as 



C(L) = (r t r t+L ) - (r t )(r t+L ) . 



(2) 
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FIG. 1. Autocorrelation C(L, 1) of \rt \ (crosses) as a func- 
tion of the correlation length L, compared with the returns au- 
tocorrelation C(L) (circles), for: a) NYSE index; b) USD-DM 
exchange rate. The scale if fixed by autocorrelation equals to 
1 at L = 0. The data for absolute returns are in agreement 
with a power-law with exponent, respectively, «(1) — 0.38 for 
NYSE index and q(1) ~ 0.39 for USD-DM rate, which are 
derived by the independent scaling analysis. 



A direct numerical analysis of (g) for the NYSE index 
(fig. la) and for USD-DM rate (fig. lb) shows that re- 
turns autocorrelation is a vanishing quantity for all L. 
This simple evidence could induce to the wrong conclu- 
sion that description is complete, i.e. returns are i.i.d. 
variables whose distribution is a truncated Levy. The 
situation is much more complicated, in fact, even if re- 
turns autocorrelation vanishes, one cannot conclude that 
returns are independent variables. Independence implies 
that all functions of returns are uncorrelated variables. 
This is known to be false, in fact volatility have a long 
memory. On the other hand, the daily volatility is not 
directly observable, and informations about it can be de- 
rived by means of absolute returns \r t \. 

It is useful to consider the following autocorrelation for 
powers of absolute returns 



C(L, 1 ) = (\r t \~<\r t+L r)-(\r t r)(\r t+L n 



(3) 



This quantity is plotted for 7 = 1 in fig. la (NYSE 
index) and in fig. lb (USD-DM exchange rate). Unlike 
returns autocorrelation, it turns out to be a non vanishing 
quantity, at least up to L ~ 150 (see flj|^(| and the 
references therein). This is a clear evidence that it is 
not correct to assume returns as independent random 
variables. 

On the other hand, figs. 1 cannot give a satisfactory 
answer about the shape of absolute returns autocorrela- 
tions. In fact, data show a wide spread compatible with 
different scaling hypothesis. In figs. 1 we have reported 
two power-law functions with exponents derived by scal- 
ing analysis, which will be performed in the next section. 
The proposed interpolations are consistent with numeri- 
cal data. 



III. SCALING ANALYSIS 

In the previous section we have seen that, consistently 
with the efficient market hypothesis, daily returns have 
no autocorrelations on lags larger than a single day. This 
fact can be also checked by using of scaling analysis. Con- 
sider the cumulative returns <f>t{L), defined as the sum of 
L successive returns r t , . . . , r t +L-i, divided by L 
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One can define N/L non overlapping variables of this 
type, and compute the associated variance Var (cf>(L)). 
Assuming that r t are uncorrelated (or short range corre- 
lated), it follows that Var (</>(L)) has a power-law behav- 
ior with exponent a — 1 for large L (see Appendix A), 
i.e. 



Var {4>{L)) ~ L~ 



(5) 
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The exponent a both for the NYSE index and USD- 
DM exchange market turns out to be around 1 (see figs. 
2 and also see [^TJ ) , confirming that returns are uncorre- 
cted. 

On the contrary, this is not true for other quantities 
related to absolute returns. In order to perform the ap- 
propriate scaling analysis, let us introduce the general- 
ized cumulative absolute returns defined as the sum of L 



successive returns \r t \ 



kt+L-i| 7 , divided by L 



1 L 

t(L,j) = |r t+i f 



(6) 
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where 7 is a real exponent and, again, these quantities 
are not overlapping. 
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FIG. 2. Variance Var (4>(L, 7)) ~ L' a{l) of the general- 
ized cumulative absolute returns as a function of L on log-log 
scales for 7 = 1 (crosses) and 7 = 1.5 (slanting crosses), com- 
pared with the variance Var (<p(L)) ~ L~ a of the cumulative 
returns (circles), for: a) NYSE index; b) USD-DM exchange 
rate. The exponents of the best fit straight lines (dashed lines) 
are, respectively: a(l) = 0.377T0.005, q(1.5) = 0.526±0.009 
and a = 0.98T0.01 for the NYSE index; q(1) = 0.393T0.016, 
a(1.5) = 0.445 ± 0.023 and a = 0.97 ± 0.01 for the USD-DM 
exchange rate. 
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FIG. 3. Scaling exponent 0(7) of the variance 
Var(<f>(L, 7)) ~ L~ a ^ as a function of 7 for NYSE index 
and USD-DM rate, where the bars represent the errors over 
the best fits. An anomalous scaling (0^(7) < 1) is shown for 
both cases. 

In appendix A we show that if the autocorrelation for 
powers of absolute returns fj^) exhibits a power-law with 
exponent 0(7) < 1 for large L, i.e. C{L, 7) ~ ; it 

would imply that 



Var ML, 7)) 



-0(7) 



(7) 



On the contrary, if the |r t | 7 are short-range correlated 
or power-law correlated with an exponent 0(7) > 1, we 
would not detect anomalous scaling in the analysis of 
variance, i.e. Var (4>{L, 7)) ~ L^ 1 . 

Our numerical analysis shows very sharply an anoma- 
lous power-law behaviour, after a very short transient 
time, in the range up to one year (L — 250) for NYSE 
index (fig. 2a), and up to six months (L — 150) for 
the USD-DM exchange market (fig. 2b). For larger L 
the number of non overlapping variables 4>(L, 7) becomes 
too small for a statistical analysis, as revealed also by the 
increasing fluctuations on variance Var (4>(L, 7)) as func- 
tion of L. The best fit straight lines are performed in the 
range, respectively, 10 < L < 250 for the NYSE index, 
and in the range 10 < L < 150 for the USD-DM rate. 

The crucial result is that 0(7) is not a constant func- 
tion of 7, showing the presence of different anomalous 
scales. The interpretation is that different values of 7 se- 
lect different typical fluctuation sizes, any of them being 
power- law correlated with a different exponent. The case 
7 = corresponds to cumulative logarithm of absolute 
returns. Approximately, in the region 7 > 4 the aver- 
ages are dominated by only few events, corresponding to 
very large returns and, therefore, the statistics becomes 
insufficient. 

In fig. 3, 01(7) is plotted as a function of 7 with error 
bars for both cases. In the NYSE index case, the expo- 
nent 0(7) exhibits a large spread, reaching an ordinary 
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scaling exponent a(j) — 1 for 7 ~ 4. On the contrary, 
the USD-DM exponent turns out to be less variable, ris- 
ing slowly towards a("f) = 1. 

We would like to stress that the scaling analysis in 
figs. 2 definitively proves the power-law behaviour and 
precisely determine the coefficients (3(7), while a direct 
analysis of the autocorrelations (as in figs, f ) would not 
have provided an analogous clear evidence for multiscale 
power-law behaviour, since the data show a wide spread 
compatible with different scaling hypothesis. 

The anomalous power-law scaling can be eventually 
tested against the plot of autocorrelations. For instance, 
the autocorrelations of r t and of \r t \ are plotted in figs. 
1 as a function of the correlation length L, and the full 
line, which is in a good agreement with the data, is not 
a best fit but it is a power-law whose exponent a(l) is 
obtained by the scaling analysis of the variance. 



IV. DISTRIBUTION OF VOLATILITY 

All the discussion in previous section concerns abso- 
lute returns. An obvious question is: 'what is the re- 
lation with volatility?'. The answer is not completely 
trivial, since from an operative point of view, the volatil- 
ity is often assumed to coincide with the intra-day abso- 
lute cumulative return or, alternatively, with the implied 
volatility which can be extracted from option prices. 

Our point of view is that the exact definition of volatil- 
ity cannot be independent from the theoretical frame- 
work. It is usually assumed that the volatility a t is de- 
fined by 



rt 



(8) 



where the u>t are identically distributed random variables 
with vanishing average and unitary variance. The usual 
choice for the distribution of the lo± is the normal Gaus- 
sian. This picture is completed by assuming the proba- 
bilistic independence between a t and Wf 

In other terms, the returns series can be considered as 
a realization of a random process based on a zero mean 
Gaussian, with a standard deviation a t that changes at 
each time step. According to the above definition, all 
the scaling property we have found on absolute returns 
directly apply to volatility. 

Volatility at is an hidden variable, since we can directly 
evaluate only daily returns. Nevertheless, in appendix B 
we show how to derive the volatility probability distribu- 
tion p{a) starting from the returns series. The key point 
is to move the problem in the space of the characteristic 
functions (Fourier transforms). 

The probability distribution p{a) is plotted in fig. 4, 
both for the NYSE index and the USD-DM exchange 
rate. The results corresponding to extreme values of 
volatility (a ~ and a ~ 0.02) are not confident due 
to insufficient statistics. 
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FIG. 4. Probability distribution p(cr) of volatility for 
NYSE index (circles) and USD-DM exchange rate (crosses), 
fitted by log-normal distributions (^|) with, respectively, 
m = -4.94 ± 0.01 and s = 0.44 ± 0.01 for the NYSE in- 
dex (fit performed in the range 0.0035 < a < 0.01), and 
m = -5.27 ± 0.01 and s = 0.54 ± 0.01 for the USD-DM rate 
(fit range 0.0025 < a < 0.005). 

The astonishing fact is that NYSE volatility distribu- 
tion is well fitted by a log-normal distribution |20|j22[| 



p(a) = 



1 



2n s a 



(9) 



The fit is performed in the range 0.0035 < a < 0.01 
and gives m = -4.94 ± 0.01 and s = 0.44 ± 0.01, while 
the USD-DM volatility distribution is consistent with a 
log-normal distribution with m — — 5.27± 0.01 and s = 
0.54 ±0.01 only in a narrow region around the maximum 
(0.0025 < a < 0.005). 

This unexpected log-normal shape for the volatility 
distribution suggests the existence of some underling 
multiplicative process for volatility, at least for the NYSE 
index. This result implies that not only indices prices are 
multiplicative processes, but also the associated returns. 
On the other hand, the USD-DM rate analysis might 
be affect by insufficient statistics problems, which leads 
to an over-estimation of distribution tail in the range 
a ~ 0.01. Under this hypothesis, a log-normal shape 
could be consistent with the USD-DM volatility distri- 
bution, and an underling multiplicative process might be 
present also for foreign exchange returns. 

A possible and reasonable tentative to explain this pe- 
culiar behaviour for the volatility distribution can be 
found in |p3| , where a multiplicative cascade process for 
volatility is proposed, borrowing well-known arguments 
from turbulence theory. 
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V. CONCLUSIONS 



which leads to 



The first result we have found is that the scaling of 
variance of the generalized cumulative absolute returns is 
power-law with non-unique exponent, for both the NYSE 
daily index and the USD-DM exchange rate. This fact 
implies power-law correlations whose exponent depends 
on the variable which is considered. The main theoretical 
consequence is that models with exponential correlations, 
like ARCH-GARCH, fails in describing the dynamics of 
financial markets, and that new models should account 
for the coexistence of long memory with different scales. 

The second result is that volatility distribution is log- 
normal, at least for NYSE index. This fact suggests that 
volatility itself evolves as a multiplicative process. 

These two results show the existence of an underling 
process that drives daily returns, and indicates that new 
modelizations of financial markets have to look to returns 
as a subordinate process of volatility. 
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APPENDIX A 

In this appendix we show that if the correlations 
C(L,j) exhibit a long range memory, C(L, 7) ~ L~ a ^\ 
then also the variance Var(<fi(L, 7)) of the generalized cu- 
mulative absolute returns behaves at large L as L - "^. 

The explicit expression of variance is 

L L 

varMm)) = ^^{|r t+i nr t+i r)-(in +< r)(ir t+j r: 

1=1 j=l 

Taking into account that r t is a stationary process, and 
using the definition of C(L,j) ©, one has: 

yar(^(L, 7 )) = ic(0, 7 ) + ^ £ C(i-j >7 ) 

L>i>j>l 

where 

c(o, 7 ) = (N 27 )-(N 7 ) 2 ■ 

The previous expression can be rewritten as 

1 2 
Var ML, 7)) = 7 C(0, 7) + ]T (L - i) C(i, 7) . 



Under the hypothesis C(L, 7) ~ L one has for large 

L 



L-l 



— J2(L-i) C(i, 7 )~L- 



1(7) 



Var (<f>{L, 7)) = OiL- 1 ) + 0(L- a ^) . 
For our data a(j) < 1, and then 

Var ML, 7)) ~ L _a(7) . 

On the contrary, if 0(7) > 1 or worst, correlations 
exhibit a faster decay, the variance Var{(f)(L, 7)) would 
be a power-law with scaling exponent equals to 1. 

A similar sketch can be repeated for the cumulative 
returns <j){L). In this case since correlation has a fast 
decay, we have 

Var(0(L, 7 ))^L- 1 . 

APPENDIX B 

Let us introduce the variables TZ t , S t , , defined as 

Tit =log|r t | 
S t = log a t 
W t = log|w t | 

which are related among them by virtue of (g) by 
K t = S t + W t • 

For the associated probability distributions (respectively 
Q(Tl), P(S), T(W)) the following relation holds 



Q(TZ) 



+ OO 



dS P(S)T(R - S) 



(10) 



The distribution P(S) retains full information on the 
volatility probability distribution p(a), since p{a) — 
P(loga)/a. 

In order to derive from ( |To| ) an explicit expression 
for P(S), it is convenient to consider the characteris- 
tic functions (Fourier transforms) Q(TZ), P(S),T(W) of 
Q{K), P(S), T(W). In fact, the following simple relation 
holds 

Q(S) = P(S)f(S) 
and the inverse Fourier transform gives 



-Hoc. 



ds m e«* 

T(S) 



Notice that Q(S) and T(S) are complex objects, but we 
may consider only the real part of the integrand, since 
the result of the integration has to be real 



P(S) 



1 

2^ 



r+00 

/ dS Re 


Q(<5) e iss 


/ —OO 


T(S) 



(11) 
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where 



Re 



f(S) 



(ReQ Ref + ImQ Imf) cos(SS) 



(ReT) 2 + (ImT) 2 



(ReQ Imf - ImQ Ref) sin(SS) 
+ (Ref) 2 + (Imf) 2 

From a practical point of view, ReQ(<S) and ImQ(tS) 
can be directly computed from the returns series 

r+oo -, N 

ReQ (5) = / dKQ(K) cos(SK) ~ — ^cos(Sft t ) 



dKQ(K) sin(STZ) ~ — ^sin(5^ t ) . 

-°° t=i 

The Fourier transforms Ref (5) and Imf (5) can be 
evaluated numerically starting from their definitions: 



ReT(S) 



Imf (S) 



dTZ T(K) cos(SK) 



dUTCll) sin(S"K) 



where 



T(K) 



Finally, the probability distribution P(S), and thenp(er), 
can be computed via the numerical evaluation of integral 
©• 

The key step of this procedure is the numerical inverse 
Fourier transform, therefore the delicate point is the eval- 
uation of the tails of the probability distribution P(S), 
where the limited number of data leads to spurious fluc- 
tuations. 
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